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ABSTRACT 


This  paper  describes  the  formation  of  a  dictionary  list  structure 
vhlch  can  be  used  by  an  English  language  translator  to  enable  natural 
language  man-machine  conversation  directed  tovards  the  retrieval  of 
Information  from  a  data  bank.  The  hierarchical  structure  of  the  let¬ 
ters  in  a  word  and  the  placement  of  word  attributes  in  this  structure 
is  discussed. 

A  computer  program,  vhlch  accepts  as  input  an  English  language 
sentence  and  processes  this  sentence  in  conjunction  with  the  dictionary 
list  structure  to  obtain  the  attributes  of  the  individual  words,  is 
described.  The  incorporation  of  this  dictionary  structure  into  a 
complete  natural  language  information  retrieval  system  is  also 


discussed. 
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LIST  OF  DEFINITIONS 


1.  Record  (Logical  Record)  -  A  collection  of  related  d*ita  elements 
treated  as  a  unit.  (E.g.,  the  topic  name  and  its  numeric  code  word). 

2.  File  -  A  collection  of  related  records  treated  as  a  unit. 

3.  Information  Retrieval  System  -  A  process  developed  to  recover 
specific  information  from  a  data  bank. 

4.  Key-Searches  -  Searches  conducted  to  retrieve  information  in 
accordance  with  a  specific  data  input  parameter  (or  set  of  parameters) 
called  a  key. 

5.  Pointer  -  A  numeric  value  used  to  specify  an  address  in  core 

6.  Null  Pointer  -  A  special  pointer  value  that  cannot  relate  to  any 
address  in  storage  (represented  by  NIL  or^). 

7.  Node  -  One  or  more  consecutive  bytes  of  core  storage  divided  into 
parts  called  fields,  (synonym  —  cell) 

8.  Tree  -  A  finite  set  of  one  or  more  nodes  such  that  there  is  one 
specially  designated  node  called  the  root  of  the  tree  and  the  remaining 
nodes  are  partitioned  into  disjoint  sets  of  trees  which  are  called 
subtrees  of  the  original  tree. 

9.  Binary  Tree  -  A  tree  structure  where  each  node  has  at  most  two 
subtrees.  When  there  is  only  one  subtree  present,  a  distinction  is 
made  between  whether  it  is  a  left  subtree  or  a  right  subtree. 

10.  Hash  Coding  -  A  method  of  effecting  random  storage  where  a  trans^- 
formation  is  performed  on  a  key  (in  this  case  the  topic  name)  to 
produce  an  address  in  the  table  which  can  be  used  to  locate  a  position 
for  placing  the  key  and  xcs  associated  item. 
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I.  INTRODUCTION 


The  Technical  Reports  Section  at  the  Naval  Postgraduate  School 
Library  currently  offers  its  users  access  to  those  reports  in  its 
holdings  by  either  of  two  methods.  First,  the  user  can  search  through 
the  card  index  to  obtain  the  report  number  of  the  item  desired,  or, 
second,  he  can  conduct  a  computerized  library  search  to  obtain  a  bib¬ 
liography  of  those  reports  in  the  holdings  which  pertain  to  the  sub¬ 
ject  area  of  interest. 

This  library  search  is  conducted  by  entering  a  numeric  code  word 
into  an  information  retrieval  system  which  will  then  use  this  code 
word  as  a  key  to  search  for  and  retrieve  a  bibliographical  listing 
of  those  reports  which  pertain  to  the  entered  code  word.  Any  number 
of  code  words  can  be  combined,  and  the  items  on  the  bibliography  re¬ 
trieved  for  the  user  will  have  all  of  the  code  words  within  their  list 
of  keys.  At  present  the  Technical  Reports  Section  has  about  100,000 
bibliographical  items  in  its  file,  and  these  items  can  be  accessed 
by  over  10,000  different  code  words. 

Although  a  library  search  is  faster  and  easier  for  the  user,  es¬ 
pecially  when  only  a  general  topic  area  is  desired,  it  does  have  a 
number  of  shortcomings.  First,  there  is  a  nominal  delay  between  the 
request  for  the  search  and  the  obtaining  of  the  bibliography  resulting 
from  that  search.  Second,  the  user  does  not  know  at  the  time  of  his 
request  whether  or  not  is  topic  area  is  too  broad  or  too  narrow  to 
obtain  a  reasonable  bibliography.  Third,  the  user  presently  has  to 
personally  search  through  a  book  of  topic  areas  to  obtain  the  numberlc 
code  words  required  for  the  information  retrieval  system. 
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mis  report  is  the  result  of  a  project  to  develop  a  flexible  and 
accersible  information  system  for  the  specific  objective  of  retrieving 
a  bibliography  of  the  reports  held  by  the  Technical  Reports  Section 
of  the  Naval  Postgraduate  School  Library.  The  proposed  system  would 
be  implemented  on  the  school* s  IBM  360/67  computer  and  would  utilize 
the  time-sharing  capability  of  that  system. 

The  system  as  conceived  would  be  similar  to  the  REAL  ENGLISH  system 
at  the  University  of  Pennsylvania  [  1  ] .  It  would  consist  of  an 
English  language  translation  capability  which  would  be  used  to  pre- 
process  the  user's  English  language  request,  which  has  been  entered 
at  the  time-sharing  terminal,  to  determine  the  specific  item(s)  desired. 
Then  the  numeric  code  associated  with  each  item  would  be  retrieved  from 
a  table  and  entered  into  the  existing  Information  retrieval  system. 

The  number  of  items  in  the  bibliographical  file  which  met  the  criterion 
of  the  code  word(s)  would  be  printed  out  at  fhe  terminal,  and  the  user 
could  then  specify  whether  or  not  there  was  a  sufficient  number  of 
bibliographical  items  for  his  use.  If  the  number  of  items  were  not 
sufficient  or  overly  abundant,  the  user  could  re-initiate  his  request 
in  either  a  broader  topic  area  or  a  more  restricted  area  to  obtain 
the  desired  number  of  items  in  the  bibliography.  If  the  number  of 
bibliographical  items  was  sufficient,  the  user  could  specify  that  the 
bibliography  was  tu  be  printed  out  at  the  terminal  or  at  the  offline 
printer  at  the  computer  installation.  A  generalized  block  diagram 
of  the  proposed  system  is  shown  in  figure  1,  page  11. 

The  remainder  of  this  report  has  been  written  in  two  parts.  The 
first  part  describes  the  formation  of  a  dictionary  structure  required 
for  the  English  language  translation,  the  processing  of  the  user's 
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RE-INITIATE 
THE  REQUEST 


request  to  obtain  from  the  dictionary  the  attributes  of  the  Individual 

j 

words,  and  the  retrieval  of  the  numerid  codes  from  the  table  of  code 
words.  The  second  part  consists  of  illustrative  examples  of  the 
system. 

The  syntactic  and  semantic  interpretation  of  the  attributes  of 

■  I 

the  request  and  the  actual  implementation  with  the  existing  informa- 

! 

tion  retrieval  system  are  left  for  further  study.  Some  work  in  the 

i  r 

area  of  sjyntactic  and  semantic  interpretation  has  been  done  by  Cautin 
[1,2]  and  Kellog  [5,6]  which  might  be  of  value  for  anyone  who  desires 
to  conduct  further  work  on  this  system. 
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II.  SYSTEM  DESIGN 


At  this  stage  in  the  development  of  the  system,  there  are  three 
main  computer  programs  or  subroutines.  The  primary  program,  LIB-SER, 
which  is  mainly  concerned  with  performing  operations  on  lists,  was 
written  in  PL/'I  [7,8,9]  to  take  advantage  of  the  list  processing  and 
recursive  capabilities  of  that  language.  Also,  PL/I  was  chosen 
instead  of  a  list  processing  language,  such  as  LISP  1.5,  because  of 
the  availability  of  PL/I  at  the  Naval  Postgraduate  School  computer 
installation. 

The  TABLE  program,  which  is  also  in  PL/I,  was  written  exclusively 
for  use  as  a  calling  program  for  the  construction  and  maintenance  of 
the  numeric  codes  table.  PL/I  was  chosen  for  this  routine  mainly 
because  of  its  character  manipulation  capabilities. 

The  final  program,  HASHX,  is  a  subroutine  written  in  IBM  360/67 
Assembly  language  [10,11]  and  is  called  into  operation  by  both  the 
LIB-SER  and  TABLE  programs.  Assembly  language  was  chosen  since  it 

r» 

allows  a  closer  control  over  the  representation  and  location  of  data 
while  in  computer  core  storage.  This  close  control  is  necessary 
since  HASHX  is  used  to  move  the  core  character  representations  of  the 
topic  names  and  their  numeric  codes  from  one  position  to  another  in 
core  storage. 

A.  LIB-SER  PROGRAM 

The  LIB-SER  program  is  the  main  program  utilized  by  the  proposed 
library  search  system.  It  is  utilized  to  build  lists,  add  Items  to 
lists,  delete  items  from  lists,  and  search  for  items  on  the  lists 


13 


maintained  during  the  operation  of  the  proposed  system.  The  lists 
of  prime  Importance  are  ATOMLIST,  the  dictionary  list  (LI) ,  the  user's 
request  list  (L2) ,  and  the  multiple  definitions  list  (L3) . 

1.  List  Structure  in  the  Program 

A  list  structure  vas  used  in  this  program  since  a  hierarchical 
structure  was  desired  for  the  dictionary  list.  This  hierarchical 
structure  is  used  to  simplify  the  search  through  the  dictionary  for 
the  attributes  of  the  words  in  the  user's  request.  The  inputs  to  the 
program,  the  dictionary  file  and  the  user's  request,  are  built  into 
a  system  of  connected  lists  where  these  lists  take  on  a  form  similar 
to  the  program  and  data  stricture  utilized  in  the  list  processing 
language  LISP  1.5  [12]. 

In  LISP  1.5,  program  and  data  are  represented  in  the  form  of 
symbolic  expressions  or  S-expressions  where  these  S-expressions  are 
of  indefinite  length  and  have  a  branching  binary  tree  structure  which 
allows  the  identification  and  isolation  of  sub-expressions  or  sub¬ 
trees  within  the  basic  structure.  The  S-expressions  are  graphically 
represented  in  this  binary  tree  structure,  as  shown  in  figure  2  on 
page  15,  as  a  sequency  of  nodes  or  cells,  each  of  which  has  two  fields. 
When  using  the  dotted  pair  notation  in  LISP  1.5,  both  of  the  fields  can 
be  used  to  contain  a  pointer  to  another  node  or  cell  or  they  can  be 
used  to  contain  a  character  string.  In  an  alternative  notation,  list 
notation,  the  left  field  is  the  only  field  which  can  be  used  to  contain 
a  character  string  or  pointer.  The  right  field  contains  a  pointer 
to  the  next  node  in  the  tree  or  a  special  pointer,  NIL,  which  is  used 
as  a  terminator  of  the  list. 
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C  (NIL 


^A-(B-C))  (A(B  C)) 

(a)  <b) 

dotted  pair  notation  list  notation 

FIGURE  2.  Graphical  Structure  of  LISP  1.5  Lists 

'  •  k:  I 

The  list  structure  used  in  this  program  is  a  modified  version 
of  the  list  notation  used  in  LISP  1.5.  This  modification  was  required 

by  the  use  of  PL/I  as  the  programming  language  and  the  requirement  of 

I 

I 

this  language  that  the  contents  of  the  fields  in  a  cell  be  explicitly 
declared.  Thus,  no  one  field  can  be  used  to  contain  both  a  pointer 
and  a  character  string  during  ai  program  execution.  This  required  that 
the  character  strings  be* placed  in  a  type  of  cell,  called  ATOMCELL  in 

this  program,  defined  separately  from  the  connective  cells,  called 

i 

CON CELL,  used  to  represent  the  tree  structure.  It  also  required,  as 

|  i 

shown  in  figure  3  on  page  16,  that  each  of  the  character  strings  which 

J 

is  placed  in  an  ATOMCELL  to  have  associated  with  it  two  CONCELLS. 

The  upper  CONCELL  is  used  to  connect  the  character  string  to  other 

| 

character  strings  at  the  same  level  in  the  subtree.  The  lower  CONCELL 
is  used  to  contain  a  pointer  to  the  character  string  contained  in  the 
ATOMCELL  and  serve  as  an  attachment  point  for  any  subtrees  of  that 
character  string. 

( 
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Since  there  are  two  CONCELLS  associated  with  each  character 
string,  a  method  of  determining  whether  or  not  the  left  field  of  the 
CONCELL  pointed  to  an  ATOMCELL  or  to  another  CONCELL  in  the  list  was 
required.  This  was  accomplished  by  the  addition  of  a  third  field  to 
the  CONCELL  type  of  cell.  A  fourth  field  was  added  to  indicate  the 
depth  of  the  occurrence  of  the  cell  In  the  list  structure  and  Is 
utilized  during  the  construction  of  the  trees. 


(A<BC>  ) 

FIGURE  3.  Graphical  Structure  of  the  Lists  Used.* 


a.  ATOMLIST  List 

The  ATOMLIST  list  is  used  to  store  the  alphabetic,  numeric, 
and  special  characters  which  are  required  by  the  program  to  be  retained 
in  their  symbolic  form  during  the  operation  of  the  program.  It  con¬ 
sists  of  a  series  of  connected  cells  called  ATOMCELLS,  shown  in  figure 
4  on  page  17,  which  have  three  fields.  Each  new  cell  added  to  the  list 
is  connected  to  the  first  cell  on  the  list. 

*A11  of  the  circled  character  strings  are  included  in  the  ATOMLIST 
list  and  only  conceptually  displayed  here.  The  7\  or  X  used  in  the 
CDRF  field  is  used  to  indicate  the  NULL  pointer. 
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The  three  fields  are  the  SIZE  field,  which  contains  the 
number  of  characters  of  the  Item  represented  In  the  cell,  the  PNAME 
field,  which  contains  the  character  representation  of  the  Item,  and 
the  NEXT  field,  which  contains  a  pointer  to  the  next  ATOMCELL  on  the 
list. 


FIGURE  4.  Structure  of  an  ATOMCELL 

Each  of  these  ATOMCELLs  Is  placed  on  this  list  as  the 
characters  they  contain  are  encountered  during  the  operation  of  the 
program,  with  the  majority  of  them  being  attached  to  the  list  during 
the  initialization  phase  of  the  dictionary.  Each  Item  Is  Included 
In  this  list  only  once  since  a  search  of  the  list  (procedure  LOOKUP) 
la  conducted  every  time  an  item  Is  encountered  to  determine  if  an 
ATOMCELL  has  or  has  not  already  been  Included  in  the  list  which  con¬ 
tains  the  character(s)  of  the  item.  No  ATOMCELLs  are  deleted  from 
this  list  during  the  operation  of  the  program. 

All  referrals  to  the  character  representation  of  any  item 
on  the  AIOMLIST  is  by  the  pointer  to  the  eatire  ATOMCELL.  This  pointer 
will  be  the  same  as  that  contained  in  the  NEXT  field  of  the  ATOMCELL 
Immediately  prior  to  the  ATOMCELL  containing  the  desired  characters, 
if  one  exists,  or  else  it  will  be  the  pointer  ATOMLIST  since  the  charac¬ 
ters  would  then  have  to  be  contained  in  the  first  ATOMCELL  on  the  list. 
A  diagram  of  the  ATOMLIST  Is  shown  below  in  figure  5. 


FIGURE  5.  Diagram  of  the  ATOMLIST 


b.  Connective  Lists 

All  of  the  remaining  lists  built  and  used  by  the  program 
are  used  to  connect  the  character  representations  In  the  ATOMLIST 
In  various  fashions.  These  lists  consist  of  a  scries  of  CONCELLS 
which  have  four  fields,  an  ATOMB  field,  a  LEVEL  field,  and  two  pointer 
fields,  CAEF  and  CDRF.  The  structure  of  the  CONCELL  is  shown  in  figure  6. 


ATOMB 

LEVEL 

CARF 

CDRF 

FIGURE  6.  Structure  of  a  CONCELL 

The  ATOMB  field  is  used  to  indicate  the  purpose  of  the 
CONCELL.  It  is  a  two  bit  field  which  can  contain  one  of  three  binary 
values,  00,  01,  or  11.  The  00  value  indicates  that  the  CONCELL  is 
being  used  as  a  connective  cell  where  both  the  CARF  and  the  CDRF  fields 
contain  pointers  to  other  CONCELLs,  with  the  exception  that  the  CDRF 
field  may  contain  a  null  pointer.  All  items  in  the  same  subtree  at 
any  particular  level  will  be  connected  by  the  CDRF  field  of  a  CONCELL 
with  a  00  ATOMB  field. 

The  01  value  indicates  that  the  CARF  field  contains  a 
pointer  to  an  ATOMCELL.  The  CDRF  field  may  contain  a  pointer  to  an¬ 
other  CONCELL  with  a  00  ATOMB  field  which  is  at  the  next  level  down 
in  the  tree,  or  else  contain  a  null  pointer.  The  11  value,  which  can 
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only  be  placed  In  the  ATOMB  field  during  the  initialization  phase  of 
the  dictionary  list.  Indicates  that  the  CARF  field  contains  a  pointer 
to  an  ATOMCELL  which  contains  the  name  of  a  multiple  definition  or  an 
attribute  name.  The  CDRF  field  always  contains  a  null  pointer. 

A  sample  connective  list  is  shown  below  in  figure  7. 


The  LEVEL  field  contains  a  numeric  indication  as  to  the 
level  in  the  tree  structure  of  the  particular  cell.  For  CONCELLs  that 
are  contained  in  the  multiple  definitions  list,  the  value  contained 
in  the  LEVEL  field  is  not  changed  when  that  definition  is  included 
at  other  places  in  the  structure  of  the  dictionary. 

The  use  of  the  multiple  definitions  will  be  explained 
later  during  the  discussion  of  the  initialization  of  the  dictionary. 

2.  File  Descriptions 

Two  files  are  used  by  the  program  during  the  initialization 

/ 

phase  to  enter  data  into  the  program,  the  Dictionary  file  and  the  Codes 
file.  Both  files  are  read-only  files  when  used  by  the  L1B-SER  program, 
a.  Description  of  the  Dictionary  File 
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(1)  Convention  and  Special  Characters.  With  the  excep¬ 
tion  o£  the  attribute  names  and  the  names  of  the  multiple  definitions 
sequences,  all  Items  in  the  dictionary  are  single  characters.  Spaces 
or  blanks  are  used  only  to  improve  the  readability  or  to  separate  items 
in  the  dictionary.  The  attribute  names  and  the  multiple  definition 
names  must  be  at  least  two  characters  In  length  and  the  multiple  de¬ 
finition  names  start  with  the  letter  M. 

There  are  eight  characters  used  in  this  file  which 
have  special  significance  during  the  operation  of  the  program.  These 
characters  are  the  right  and  left  parentheses,  the  greater- than  symbol 
(>),  the  less-than  symbol  (<),  dollar  sign  ($),  the  minus  sign 
(-),  the  asterisk  (*),  and  the  slash(/). 

The  six  characters ,  right  and  left  parentheses , 
greater-than  symbol,  less-than  symbol,  the  dollar  sign,  and  the  slash, 
are  used  during  the  building  of  the  list  structure  to  determine  the 
proper  connections  which  have  to  be  made.  The  left  parenthesis  is 
used  to  indicate  the  beginning  of  the  dictionary  file.  The  right  paren¬ 
thesis  Is  used  to  Indicate  the  end  of  the  dictionary  file.  The  dollar 
sign  Is  used  to  terminate  the  processing  of  the  current  input.  The 
less-than  symbol  is  used  to  indicate  that  the  items  that  follow  are 
to  be  attached  to  the  preceding  character  at  the  next  lower  level  in 
the  tree  structure.  The  greater-than  symbol  is  used  to  indicate  that 
items  at  the  current  level  in  the  subtree  have  been  processed  and  that 
the  following  items  are  not  to  be  attached  at  that  level  but  at  the 
next  higher  in  the  tree  structure.  The  slash  is  used  to  indicate  that 
the  next  item  is  the  name  of  a  multiple  definition  and  that  the  pre- 
\yiously  formed  subtree,  which  is  attached  to  that  multiple  definition 
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n/.aie,  is  to  be  inserted  at  this  point  in  the  current  subtree. 

The  two  characters,  the  minus  sign  and  the  asterisk, 
are  used  during  the  attribute  determination  phase  to  indicate  special 
conditions  or  operations .  The  minus  sign  is  used  to  indicate  that  the 
following  attribute  is  not  to  be  retained  for  the  word  thus  formed. 

This  attribute  may  or  may  not  have  been  retrieved  for  the  word  under 
consideration.  The  asterisk  is  used  to  indicate  that  a  complete  word 
has  been  placed  in  the  tree  structure  to  that  point  in  the  tree,  it 
is  also  used  as  the  attachment  point  for  any  remaining  attributes  of 
the  word  that  it  terminates . 

(2)  File  Description.  The  Dictionary  File  consists  of 
a  sequence  of  characters  contained  in  a  series  of  sixty-character 
records.  The  file  begins  with  the  left  parenthesis  and  end3  with  a 
right  parenthesis  and  a  dollar  sign  in  the  last  record.  The  first 
character  in  the  sequence  after  the  left  parenthesis  is  the  symbol 
one  (1)  which  is  used  as  an  attachment  point  for  all  of  the  multiple 
definitions.  The  remaining  characters  are  placed  into  the  file  such 
that  the  tree  structure  which  is  built  from  this  file  will  contain 
all  of  the  words  entered  into  the  dictionary  of  the  program. 

At  the  zero  level  in  the  dictionary  tree,  the  twenty- 
six  letters  of  the  Latin  alphabet  are  attached  to  the  preceding 
character  in  descending  order  according  to  their  probability  of  occurrance 
in  the  English  language  [13].  These  letters  are  used  as  the  first 
character  of  all  words  in  the  dictionary.  Each  of  the  remaining  letters 
in  a  word  is  preceded  by  the  less-than  symbol  to  indicate  that  they  are 
to  be  attached  to  the  previous  letter  in  the  word  at  the  next  lower 
level  in  the  subtree. 
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Where  there  are  attributes  that  are  common  to  a  number 
of  words  with  similar  beginnings ,  those  attributes  can  be  included  in 
the  sequence  immediately  prior  to  the  beginning  of  the  dissimilarity. 
Wherever  a  complete  word  has  been  represented,  an  asterisk  is  Included 
in  the  sequence  following  the  last  letter  of  the  word.  When  a  word 
being  described  has  a  common  ending,  such  as  the  plural  endings  ES 
and  S,  the  multiple  definition  name  for  that  ending  is  placed  in  the 
sequence  immediately  following  the  slash  symbol,  and  this  multiple 
definition  name  is  used  to  complete  the  word.  Figure  8  illustrates 
the  use  of  the  multiple  definition  name  in  the  sequence. 

S<0<(f<E<*  ADVBP »»  E<N<D  <VERBP  *  /MING»» 

FIGURE  8.  Sample  Dictionary  Record 
b.  Codes  File. 

The  codes  file  consists  of  a  series  of  sixty-character 
records  of  which  only  the  first  twenty  characters  are  utilized.  Each 
record  consists  of  the  topic  name  beginning  in  the  first  character  posi¬ 
tion  and  followed  by  the  five  number  numeric  code  word  in  the  last 
five  character  positions  with  blanks  in  between.  The  records  are  ar¬ 
ranged  in  the  file  according  to  the  hash  coded  index  of  each  item  that 
is  determined  during  the  operation  of  the  TABLE  program  and  its  call 
to  the  subroutine  KASHX  . 

3.  Initialization. 

The  program  has  an  initialization  phase  in  which  the  dictionary 
file  is  read  and  structured  into  the  tree  and  subtrees  required  for 
later  use  in  the  determination  of  the  attributes  of  a  user's  request. 
After  the  dictonary  has  been  structured,  the  table  file  is  read  into 
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an  array  for  use  during  the  search  for  the  numeric  code  words  required 
for  the  information  retrieval  system. 

a.  The  Dictionary  Tree  Structure. 

The  dictionary  tree  structure  is  a  connective  list  which 
has  pointers  in  the  CAST  field  of  the  appropriate  CONCELL  to  point 
to  the  required  symbolic  data  in  the  ATOMLIST.  It  is  formed  by  first 
constructing  the  multiple  definitions  subtree  and  then  constructing 
the  remaining  subtrees,  all  of  which  start  with  the  letters  of  the 
alphabet  and  some  punctuation  characters.  All  of  the  subtrees  are 
constructed  independently  by  the  recursiveness  of  the  structuring 
procedure  (S_EXPR)  until  the  last  subtree  has  teen  structured  at  which 
time  the  subtrees  are  connected  to  form  the  dictionary  tree.  The 
recursive  construction  of  the  tree  is  illustrated  in  figure  9  on  page  24. 

Although  the  multiple  definitions  subtree  is  constructed 
in  the  same  manner  as  all  of  the  other  subtrees  in  the  dictionary, 
there  is  a  special  pointer  (L3)  which  is  assigned  to  point  to  this 
subtree  during  the  initialization.  This  assignment  occurs  immediately 
upon  completion  of  the  structuring  of  the  subtree  and  before  the  next 
subtree  is  started.  This  pointer  is  required  during  the  structuring 
of  the  remaining  subtrees  as  a  beginning  pointer  for  the  procedure 
(LOOKLIST)  when  a  search  for  a  multiple  definition  used  in  the  other 
subtrees  is  required.  The  multiple  definitions  subtree  cannot  be  re¬ 
cursively  defined  (i.e.,  a  multiple  definition  which  uses  another  mul¬ 
tiple  definition  as  one  of  its  endings)  since  the  special  pointer  is 
not  assigned  until  the  completion  of  the  entire  structuring  of  the 
subtree. 

During  the  construction  of  the  remaining  subtrees,  the 
multiple  definitions  are  used  whenever  a  slash  is  encountered  during 
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(c)  Complete  Return 


FIGURE  9.  Recursive  Structure  of  Connected  Lists 


the  scanning  of  the  Input  symbols.  When  this  occurs,  the  procedure 
LOOKL1ST  is  invoked  to  determine  the  pointer  to  the  subtree  required 
at  this  point  for  the  completion  of  the  word  being  entered  into  the 
dictionary.  The  program  allows  more  than  one  multiple  definition  to 
be  used  as  an  ending  of  a  word  with  the  stipulations  that  none  of  the 
multiple  definition  subtrees  can  begin  with  the  same  letter  nor  can 
the  multiple  definitions  subtrees  begin  with  the  same  letter  as  is 
already  included  in  the  subtree  of  the  word  at  that  level.  The  use 
of  the  multiple  definition  is  illustrated  in  figure  10  on  pages  26 
and  27. 

b.  The  Table  Array 

The  table  file  is  copied  directly  into  the  array  area, 

CODE.  No  processing  occurs  with  the  items  in  the  file  during  the 
initialization  phase  of  the  program.  Currently,  there  is  a  logical 
switch  (true  or  false)  which  has  been  turned  off  (false)  to  prevent 
the  table  file  from  being  read  into  the  CODE  array.  When  further 
implementation  of  the  proposed  library  search  system  has  been  accom¬ 
plished,  this  switch  can  be  turned  on  to  allow  the  initialization  of 
the  table  array  for  use  in  the  table  search  procedure. 

4.  Processing  of  a  Request 

After  the  initialization  phase  has  been  completed,  the  computer 
will  loop  through  the  remaining  parts  of  the  program  until  the  string 
ST0P$  is  entered  at  the  terminal  by  the  operator. 

a.  Entry  of  a  Request 

The  user  is  requested  by  the  program  to  enter  his  name 
and  student  box  number  (or  other  appropriate  school  mail  box)  to  be 
used  for  identification  purposes.  The  user  is  then  asked  to  enter 
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his  English  language  request  surrounded  by  right  and  left  parentheses 
and  ending  with  a  dollar  sign.  If  the  request  is  too  long  to  be  en¬ 
tered  at  one  time,  the  right  parenthesis  and  dollar  sign  can  be  left 
off  until  the  entire  request  has  been  entered. 

The  request  entered  will  be  built  into  a  tree  (L2)  in  the 


same  manner  as  the  dictionary  tree  where  all  of  the  words  and  punctua¬ 
tion  are  entered  as  single  item.  Figure  11  is  an  example  of  a  typical 
user 's  request  tree. 


FIGURE  11.  Sample  User's  Request 

This  tree  structure  will  then  be  passed  to  the  attribute  determination 
routine  for  processing. 

b.  Determination  of  the  Attributes 

In  the  procedure  ATTRB,  each  word  In  the  user's  request 
is  separated  into  its  constitutent  letters  and  these  letters  are  passed 
one  at  a  time  to  the  LOOKLIST  procedure.  The  ATTRB  procedure  will  also 
pass  to  the  LOOKLIST  procedure  the  subtree,  or  tree  if  the  letter  is 
the  first  letter  of  the  word,  on  which  the  LOOKLIST  procedure  is  to 
search  for  the  letter. 

In  the  LOOKLIST  procedure,  each  letter  is  searched  for  only 
in  the  highest  level  of  the  subtree  passed  to  it.  Any  attributes  which 
are  encountered  before  the  letter  is  found  will  be  attached. to  a  tem¬ 
porary  pointer  (L3)  for  later  combination  with  the  final  attributes. 
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When  the  letter  has  been  found,  the  pointer  contained  in  the  CDRF  field 
of  the  CONCELL  which  points  to  the  ATOMCELL  containing  the  letter  is 
returned  to  the  ATTRB  procedure.  This  pointer  will  be  used  by  the 
ATTRB  procedure  as  the  pointer  to  the  subtree  used  during  the  next 
call  to  LOOKLIST. 

This  searching  will  continue  until  all  of  the  letters  of 
the  word  have  been  found.  After  the  last  letter  has  been  processed, 
the  ATTRB  procedure  will  pass  to  the  LOOKLIST  procedure  an  asterisk 
and  the  latest  subtree.  The  pointer  (Q)  returned  from  this  call  will 
point  to  the  list  of  final  attributes  of  the  word,  if  any. 

At  this  point,  both  the  temporary  attribute  list  and  the  * 
final  attribute  list  will  be  passed  to  the  DELETE  procedure  for  the 
deletion  of  any  attributes  in  the  temporary  list  which  are  indicated 
as  not  applying  to  the  word  by  the  prefixed  minus  sign  on  the  attribute. 
The  before  and  after  structure  of  the  temporary  and  final  attribute 
lists  are  illustrated  in  figure  12. 


(a)  | I  (b) 

Before  Deletion  After  Deletion 

FIGURE  12.  Temporary  and  Final  Attribute  Lists 


The  temporary  list  and  the  final  list  will  be  connected  after  this 
deletion  has  taken  place,  and  the  resulting  list  will  be  attached  to 
the  word  on  the  user’s  request  list  (L2)  as  shown  in  figure  13. 


FIGURE  13.  User's  Request  with  Attributes  Attached 

As  the  processing  of  each  word  has  been  accomplished,  a 
message  is  displayed  at  the  terminal  to  the  effect  that  the  attributes 
have  been  found.  If  during  the  search  for  the  letter  of  a  word  by  the 
L00KL1ST  procedure,  that  letter  is  not  found,  a  null  pointer  is  returned 
to  the  ATTRB  procedure.  This  null  pointer  will  result  in  the  ATTRB 
procedure  displaying  a  message  at  the  terminal  that  the  attributes  of 
the  word  have  not  been  found,  and  then  the  procedure  will  begin  to 
process  the  next  word  in  the  request. 

After  all  of  the  words  have  been  processed,  the  computer 
prints  out  the  user's  list  which  now  includes  the  attributes  of  the 
individual  words.  The  computer  then  branches  back  to  the  beginning 
and  requests  a  new  user  to  enter  his  name  and  box  number. 

B.  TABLE  PROGRAM 

Since  the  TABLE  procedure  in  the  main  program,  LIB-SER,  was  written 
to  allow  only  the  retrieval  of  the  numeric  code  words  from  the  CODE 
array,  the  auxiliary  program,  TABLE,  was  written.  The  TABLE  program 
allows  the  entry  of  an  item,  the  deletion  of  an  item,  and  the  retrieval 
of  information  contained  in  an  item  in  the  CODE  array. 


30 


The  TABLE  program  first  initializes  the  CODE  array  by  copying 
directly  into  the  array  the  individual  records  in  the  CODES  file. 

The  records  are  truncated  after  the  twentieth  character.  Once  in  the 
array,  the  program  calls  the  subroutine  HASHX  with  various  parameters 
to  perform  the  functions  of  retrieval,  deletion,  or  storage  of  an 
item  in  the  array.  When  the  required  modifications  to  the  array  have 
been  accomplished,  the  CODE  array  is  copies  back  into  the  CODES  file 
with  each  item  being  padded  with  blanks  on  the  right  to  fill  out  each 
record. 

Although  the  TABLE  program  was  written  to  be  operated  from  the 
time-sharing  terminal,  with  slight  modifications  it  could  be  run  under 
the  batch  processor  at  the  computer  installation.  This  might  be  de¬ 
sirable  if  the  number  of  modifications  is  substantial. 

C.  HASHX  SUBROUTINE 

The  HASHX  subroutine,  which  is  used  to  compute  the  address  of  a 
location  in  a  table  or  array  and  either  return  the  contents  of  that 
location  or  place  an  item  in  that  location,  is  based  on  the  work  done 
in  scatter  storage  techniques  by  W.  D.  MAURER  [14].  The  method  used 
here  is  referred  to  as  a  "division  hash  code"  to  distinguish  it  from 
the  logical  or  multiplicative  methods  most  often  used  in  scatter 
storage  systems  [15]. 

Hash  coding  techniques  can  be  applied  to  any  table  or  array  in 
which  the  access  is  _o  be  made  to  the  entries  in  an  unpredictable 
order  and  the  items  are  identified  by  some  key  or  name  associated  with 
their  contents.  In  the  hash  coding  technique,  a  transformation  is 
performed  on  the  key  to  produce  an  address  in  the  table  where  the  key 
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and  the  entry  associated  with  that  key  can  be  placed.  A  good  trans¬ 
formation  is  one  which  will  distribute  these  addresses  uniformly 
across  the  available  table  area. 

In  most  hash  coding  techniques,  the  table  size  is  restricted  to 

being  a  power  of  two,  since  the  common  method  of  obtaining  the  address 

is  to  calculate  a  k-bit  field  which  is  assumed  to  be  a  random  integer 

* 

between  zero  and  2^-1,  and  this  integer  is  then  used  as  the  address. 

In  the  method  proposed  by  MAUSER  and  used  in  this  subroutine,  the 
table  size  is  a  prime  number,  and  therefore  it  can  be  almost  any  size 
desired. 

1.  Determination  of  the  Hash  Code 

The  hash  coding  method  used  here  consists  of  using  the  first 
eight  characters  of  the  topic  name  as  the  key.  The  first  four  charac¬ 
ters  are  exclusively  OR-ed  with  the  last  four  characters  to  obtain  one 
full  computer  word  (32  bits  on  the  IBM  360/67  computer) ,  and  this 
computer  word  is  divided,  using  integer  arithmetic,  by  the  table  size. 
The  remainder  from  this  division,  which  is  a  number  between  zero  and 
one-less-than-the-table-size,  is  used  as  the  hash  code. 

This  method  of  calculating  the  hash  code  is.  comparable  in  speed 
to  the  multiplicative  and  logical  methods  with  the  advantage  that  is 
almost  completely  free  of  the  nonrandomness  which  often  occurs  in  the 
other  methods. 

In  using  this  method  of  hash  coding,  a  change  in  the  table 
size  will  change  all  of  the  hash  codes  for  the  table  entries.  If  the 
table  size  has  to  be  changed,  all  of  the  hash  codes  for  the  entries 
must  be  recomputed.  By  using  a  prime  number  which  is  twenty-five 
percent  larger  than  the  maximum  number  of  expected  entries  for  the 
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table,  the  average  number  of  steps  required  to  find  an  item  in  the 
table  is  less  than  two,  i.e.,  the  original  hash  code  address  and 
one  collision  modification. 

2.  Collision  Handling 

In  this  method,  as  in  all  other  hash  coding  methods,  when  two 
keys  have  the  same  hash  code,  a  "collision"  is  said  to  have  occurred. 
When  a  collision  occurs,  the  item  causing  the  collision  must  be  located 
out  of  place  in  the  table. 

There  are  a  number  of  methods  for  determining  a  location  for 
this  item,  such  as  searching  linearly  forward  in  the  table.  Another 
approach  is  to  add  a  random  number  to  the  collision  hash  code  to  de¬ 
termine  a  new  location.  Neither  of  these  methods  is  very  satisfactory 
since  they  are  intrinsically  slow  if  there  is  any  sort  of  clustering 
of  the  items  In  the  table  (many  items  with  the  same  hash  code  place 
next  to  each  other) . 

The  method  of  handling  collisions  used  in  this  subroutine  is 
the  "quadratic  search".  In  this  method,  a  quadratic  equation  is  used 
to  calculate  the  new  location.  The  hash  code  which  caused  the  colli¬ 
sion  is  used  as  the  constant  term  in  the  equation  and  the  other  coef¬ 
ficients  depend  upon  the  table  size.  The  quadratic  search  was  shown 
by  MAURER  to  always  search  for  the  item  or  an  empty  location  by  looking 
at  exactly  half  of  the  remaining  table  locations  if  the  table  size  is 
a  prime  number.  In  using  this  method  of  handling  collisions,  the  table 
is  declared  full  when  the  quadratic  search  has  searched  through  half 
of  the  table. 
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III.  AN  EXAMPLE 


To  demonstrate  the  operation  of  the  LIB-SER  program,  only  a  repre¬ 
sentative  dictionary  is  utilized.  For  a  total  implementation  of  this 
system,  the  dictionary  would  have  to  be  greatly  expanded.  The  dictionary 
used  for  this  demonstration  contains  seventy  words . 

The  program  is  loaded  for  operation  at  the  time-sharing  terminal 
by  entering  LDRM.  LDRM  is  a  time-sharing  executive  file  which  causes 
the  program  and  the  system  library  to  be  loaded  into  the  computer  and 
starts  the  execution  of  the  program.  If  a  non-blank  character  had 
been  entered  after  LDRM,  the  program  would  request  the  operator  to 
enter  optional  parameters  for  use  by  the  program.  These  parameters  are 
logical  switches  which  can  be  used  to  display  traces  through  the  program 
or  selectively  enable/disable  the  various  routines  in  the  program. 

After  execution  has  begun,  and  any  optional  parameters  entered, 
the  program  initializes  the  dictionary  tree  structure.  The  computer 
will  then  ask  a  user  to  enter  his  request,  which  will  then  be  processed 
to  obtain  the  attributes  of  the  individual  words.  Figure  15  illustrates 
a  complete  run  for  an  individual  user. 

The  difference  between  the  attributes  for  a  word  which  has  been 
processed  with  the  DELETE  procedure  disabled  and  the  same  word  when  it 
has  been  enabled  is  illustrated  below  in  figure  14. 

The  computer  will  then  loop  back  to  ask  a  new  user  to  enter 
his  request.  The  program  can  be  stopped  by  entering  STOP$. 

(INFORM<NOUNT  -NOUNP  VERBP> )  (INFORM<VERBP>  ) 

(a)  (b) 

DELTE  DISABLED  DELETE  ENABLED 

FIGURE  14.  Effects  of  DELETE  Routine 
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ENTER  NAME  AND  SMC  NUMBER 
_c.t.  schmidt  box  1962 

ENTER  REQUEST,  SURROUND  WITH  (  )$ 

_(vhat  do  you  have  on  computers ?)$ 

WHAT 

DO 

YOU 

HAVE 

ON 

COMPUTERS 

? 

****************************** 

C.T.  SCHMIDT  BOX  1962 
*********** ************* ****** 


ATTRIBUTES  FOUND  FOR 
ATTRIBUTES  FOUND  FOR 
ATTRIBUTES  FOUND  FOR 
ATTRIBUTES  FOUND  FOR 
ATTRIBUTES  FOUND  FOR 
ATTRIBUTES  FOUND  FOR 
ATTRIBUTES  FOUND  FOR 


WHAT 

DO 

YOU 

HAVE 

ON 

COMPUTERS 

? 


(WHAT (INTER  ADVBP>  DO  (vERBP)  YOU<PRNOUNP>  HAVE  <VERBP  PRN3SG> 
ON<PREPP>  COMPUTERS  OJOUNP  PLUR>  ?  <PUNTI>  ) 


**  AT  THIS  POINT  YOUR  REQUEST  WOULD  BE  PASSED  TO  THE 
TRANSLATION  AND  TABLE  SEARCH  ROUTINES.  ** 


FIGURE  15.  Sample  Program 
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IV.  CONCLUDING  REMARKS 

This  paper  has  discussed  a  dictionary  list  structure  which  could 
be  used  by  an  English  Language  translator  to  enable  natural  language 
man-machine  conversation  directed  towards  the  retrieval  of  information 
from  a  data  bank.  The  development  of  the  English  Language  translator 
utilizing  this  dictionary  structure  and  the  inclusion  of  this  translator 
into  the  existing  information  retrieval  system  used  by  the  Technical 
Reports  Section  at  the  Naval  Postgraduate  School  Library  would  provide 
the  school  a  flexible  and  accessible  information  system  for  use  in 
obtaining  a  bibliography  of  the  reports  held  by  the  library. 

A  translation  procedure  which  is  based  on  a  generative  model  of 
syntax  and  semantics  that  is  comprehensive  enough  to  automatically 
resolve  some  forms  of  syntactic  and  semantic  ambiguities  would  be  re¬ 
quired  for  this  system.  The  translator  would  have  to  be  able  to 
syntactically  parse  the  user's  request,  using  the  attributes  determined 
from  a  dictionary  structure  such  as  described  in  this  paper,  and  then 
use  this  parse  in  conjunction  with  the  semantic  environment  of  the 
system  to  determine  the  topic  areas  of  interest  from  the  user's  request. 
In  the  accomplishment  of  this  goal,  it  should  have  some  means  of  com¬ 
munication  with  the  user  to  resolve  any  ambiguities  which  it  is  not 
otherwise  able  to  resolve.  Once  the  topic  areas  of  interest  have  been 
determined,  the  system  can  readily  produce  the  numeric  code  words 
required  to  obtain  a  bibliography  from  the  existing  Information 
retrieval  system. 
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/************************************************** 

*************************************************** 


**  ** 

**  LIBRARY  SEARCH  SYSTEM  ** 

**  ** 

**  LIB  SER  PROGRAM  ** 

**  ** 


*************************************************** 

**************************************************/ 


LIB  SER:  PROC(PARMS)  OPT! on$ { MAT N) : 

DCL  CODES  FILE  STREAM  ENVIRONMENT  («=(BOn, 

DICT  FILE  STREAM  ENVIRONMENT  C  *=  ( SO  ) )  t 
DCL  PAPMS  CHAR(8IVAR  ,CQDE(11)  CHAP(20): 

OCL  <ATOML!ST,P,NIL)RTP,rsiZE  FIXED  BIN, 

/************************************************** 

*  DECLARATION  oc  THE  FIELDS  OF  THE  CELLS  ON  * 

*  THE  'ATOMLIST*  LIST.  * 

**************************************************/ 

1  ATOMCELL  BASED! P) , 

2  SIZE  FIXED  BIN, 

2  NEXT  PTR  , 

2  PNAME  CHAR ( CSI ZE  REfpR( SI ZE) 1 , 

/************************************************** 

*  DECLARATION  OF  the  FIELDS  OF  THE  CELLS  ON  * 

*  LI,  L?,  AND  L3  LTSTS.  * 

♦***♦***♦♦♦*** ************ ************************/ 


1 


CONCELL  BASED! P) , 

2  ATOMP  B I T( 2 ) , 

2  LEVEL  FIXED  RI N 
2  CAPF  PTR, 

2  CORF  PTR i 


» 


/********************* ***************************** 

*  DECLARATION  oc  A  FT LE  CONTROL  BLOCK  USED  * 

*  DURING  THE  READING  AND  WRITING  OF  THE  * 

*  FILES  USED.  * 

**************************************************/ 


DCL  1  FCR  STATIC, 

2  COMMAND  CHAR! B) , 

2  FILENAME  CHAR ! R }  I  NT T ! ' DI CT ' ) , 

2  FIIETYPE  CHAP  (  R I  I NI T ( 'DATA ' )  , 

2  CAP DNUM  FIXED  BIN, 

2  STATUS  FIXED  BIN  IMTT(O), 

2  CAPD_BUFF  CHAP ! 901 : 

/************************************************** 

*  PROCEDURE  WHICH  IS  USED  to  INITIALIZE  THE  * 

*  DICTIONARY  AMD  THE  TAPLF  ARRAY.  * 

**************************************************/ 

DCL  IN  I  T  fmtpYJ 
INIT:  PROC: 

DCL  I  fixed  PIN: 

ATOMLI ST ,NI L=NULL : 

FLO  =  '  1  *P  : 

COMMAND  =  «onnuF»: 

DISPLAY! 'INITIALIZING  FROM  FUE'J: 

LI  =  READS: 

COMMAND  =  'FINIS': 

CALL  IHEFILE<FCRJ : 

FLO  =  'O'P : 
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IF  TP  I  THEN  CALL!  PR  I  NTS (COR ( LI ) ) : 

IF  TBL  THEN  DO:  ! 

COMMANO  =  'RDBUF': 

FILENAME  =  ’Cones'  : 

DO  I  *  l  TO  trlSI  7  = : 

CARDNUM  =  |T  : 

CALL  IHEF ILE (FCB) : 

COHEd  )  =  'C AR 0_RUpc  : 

IF  TRI  THEN  DISPLAY! 1  CnDE  ( '  1  !  T  t  •  )  »M 
CODE!!  Y)  :  F NO  : 

COMMAND  =  'PINTS*  : 

CALL  IHEFILE(f=CBJ  :  END: 

DISPLAY ( '  CND  OF  INITIALIZATION*) : 

ENO  init: 

/************************************************** 

*  PROCEDURE  WHICH  IS  USED  TO  ENTER  ITEMS  * 

*  INTO  THE  PROGRAM.  * 

*************************************** <***********/ 

i 

DCL  INPUT  ENTRY( CHAR ( 72) ) ? 

INPUT:  PROC(P): 

DCL  B  CHAR ( 72 ) , ( S  CHAR{8),I  FI XeD  BTN  I NIT (0 ) ) ST  AT IC 
IF  FLD  THEN  DO: 

I  *  1*1 : 

CARDNUM  =1: 

CALL  IHEFILE(FCR) : 

8  *  CARD  RUFF : 

IF  tr i  THEN  DT SPLAY! 'BUFFER  *  I ! 8) S  END: 

ELSE  DISPLAY! 'ENTER  STRING,  SURROUND  HUH  (  )*') 
REPLY! B ) :  [ 

END  INPUT: 

/************************************************** 

*  PROCEDURE  WHICH  IS  USED  TO  PRINT  OUT  ITEMS  * 

*  FROM  THE  PROGRAM.  * 

**************************************************/ 

DCL  OUTPUT  FNTRY! CHAR ( *)  )  ; 

OUTPUT:  PROC(C): 

DCL  C  CHAR!*), I  FIXED  BIN  I  NT T (0) ST ATIC : 

IF  FLD  THEN  DO: 

I  =  1+1 : 

CARDNUM  =  I: 

COMMAND  =  'WRRUP'; 

CALL  IHEFTLE(FCB) :  END: 

ELSF  D I  SPLAY! C )  ; 

END  OUTPUT: 

/************************************************** 

*  PROCEDURE  USP  D  TO  DETERMINE  THE  FUNCT I  ON  * 

*  OF  THE  CELL  SPECIFIED  BY  THE  POINTER.  * 

**************************************************/ 

DCL  ATOM  ENTRY(PTR)  RETURNS! BIT! 2) ) : 

ATOM:  PROC (A)  BIT(2) : 

IF  A  =  NI  L  THEN  RE TUR N! * 01 » B)  : 

RETURN! A->ATOMS) : 

END  ATOM: 

/************************************************** 


*  PROCEDURE  WHICH  CONSTRUCTS  A  CHARACTER  * 

*  POINTER  CELL  (A  CELL  WITH  EITHER  A  11  OR  01  * 

*  ATOMB  FIELD)  AMD  INSERTS  THC  POINTER  * 

*  TO  THE  CHARACTERS)  ON  the  'ATOmlIST'  LIST  * 

*  IF  ALREADY  ON  the  LIST  OR  ELSE  PLACES  * 

*  THE  CHARACTER  ( S )  ON  THF  'ATHEIST'  LIpT  * 

*  AND  INSERTS  THE  POT  NTER  TO  this  NEW  CELL.  * 


****************** ******************** ************/ 

OCL  STRING  ENTRY!CHAR (*) VAR)  RETURNS (PTR ) : 

STRING:  PRCC(C)  °TR : 


i 


DCL  C  CHAR ( * J 
IF  TRC  THEN  D 
0  *  LOOKUO(C) . 

ALLOCATE  CONCELL  SET(R)  : 

L  ■  LENGTH(C)  ; 

IF  L>1  THEN 

IF  FLO  THFN  ATOMR  =  'll'R: 

EL  Sp  ATOMS  =  *01  *B: 

ELSE  ATQMB  *  *01 »R: 

corf  =  nil: 

LEVEL  =  LEV: 

Ip  0  =  NIL  THEN  00: 

CSIZE  *  L: 

ALLOCATE  ATOMCELL  SeT(OJ: 

Q->NE  XT  =  ATOMLI ST: 

0->PNAME  =  C: 

ATOMLIST  =  0:  eno  : 

CARF  =  0: 

return:  p  ) : 

END  STRING: 

■ /************************************************** 

*  PROCEDUPE  WHICH  SEARCHES  THROUGH  THE  * 

*  ' ATCMLI ST  *  LIST  TO  FI  MO  THE  CELL  LOCATION  * 

*  OF  THE  INPUT  CH AR ACTER <  S)  .  * 

*********************** ****** ********* ************/ 

OCL  LOOKUP  ENTRY( CHAR  I  *  J  VA°)  RETURNS (PTR) : 

LOOKUP:  PROC(C)  PTR: 

DCL  C  CHAR ( *) VA  R  » 0  PTR  STATIC: 

0  =  ATOMLIST: 

DO  WHILEf  Q-.=N!L)  : 

IF  0->PNAME  =  C  THEN  RETURN?  0) : 

o  =  q->next:  end: 

RETUR  N(  NIL): 

END  LOOKUP: 

/************^ ************************************* 


*  PROCEOURp  WHITH  SCAMS  THE  INPUT  qtjFFEP  * 

*  AREA  AND  PETURNS  VALID  CHARACTER  STRINGS  * 

*  WITHOUT  BLANKS  TO  tHE  CALLING  POINT.  THE  * 

*  VALID  CHARACTER  STRINGS  ARE  SINGLE  SPECIAL  * 

*  (EXCEPT  AND  ALPANUM=RIC  STRINGS  * 

*  INCLUDING  THE  CHARACTER.  * 


**************************************************/ 

DCL  SCAN  RETURNSICHAR I 80IVAR) ; 

SCAN:  PROC  CHAR (  80) VA® ; 

OCL  ( ( BP  INIT(72I  ,B)FTXED  B!N,!T  CHAP(1J, 

BUFF  CHAR( 72) ) ) STATIC: 

B  =  0 « 

DO  *WHILE(B=0)  : 

IF  BP  >=  72  THEN  HP; 

BP  =  0:  CALL  INRUT(BUFF):  END: 

3P  =  +  i • 

IF  SUBSTRfBUPPjBP*!)-'3'  •  THFN  B  =  BP:  ENO: 
T=SUBSTR ( BUCC  »B  »1 5  : 

IP  T  <  '  A  *  S  T  -t=  *-»  THEN  RCTUPN(T): 

DO  WHILE?  RP<72  t  t  '  i  it  *•-'  I  T  >=*A*II: 
BP  =  BP+1 :  T=SURSTR ( PUFF ,BP,1 ) :  END: 

BP=RP-1 : 

RETURN:  SURSTR(BURF.B,eP-R+m  : 

END  SCAN: 


VAR , (0  PTR.L  PIXP 
I  SDLAYC  STRING  M 


PD  BIN)  STATIC:  ■■ 

MCI  I*  AT  LEVEL'  I  ILFV) 


/************************************************** 


*  PROCEOURp  WHICH  CONSTRUCTS  A  CONNECTIVE  * 

*  CELL  (A  CEIL  WITH  a  OO  ATOMR  PIELD)  * 

*  WITH  THE  INRUT  POINTERS  AND  THE  * 

*  CURRENT  LEVEL.  * 


************************************************** / 


DCL  CONS  ENTRY!PTR,PTR)  RETURNS: PTR) : 
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CONS:  PROC!A,BJ  ptp : 

DCL  (A.B)PTR: 

ALLOCATE  CONCELL  SET(P) 
ATOMS*' 00 'B: 

LEVEL  *  LEV: 

CAR  F=A : 

CDRF*B: 

RFTURN(P): 

ENO  CONS: 


/*****************************  ************ ********* 

*  PROCEDURE  WHICH  RETUP  NS  THF  POINTER  * 

*  CONTAINED  IN  the  CARE  FIFLD  OF  THF  * 

*  SPECIFIED  CELL.  * 

***********  *******************  ********«************/ 


DCL  CAR  ENTRY! PTR  I  PETURNS ( PTR ) : 

CAR:  PROC(A)  PTR; 

DCL  A  PTR; 

IF  A=NT  L  THEN  RE tur N< NI L)  : 

IP  A->ATOMB  THEN  RE  TURN  ( NI L ) : 

RETURN ( A- >CAR F )  ; 

END  CAR ; 

/***************  *********************************** 

*  PPOCEDUPc  WHICH  RETURNS  THE  POINTER  * 

*  CONTAINED  IN  THE  CORF  FIELD  OF  THF  * 

*  SPECIFIED  CELL.  * 

**************************************************/ 


DCL  COR  ENTRY! PTR J  RETURNS ! PTR ) : 

CDR:  PROC ( A )  PTR; 

DCL  A  PTR; 

TF  A=NI L  THEN  RETURN! NI L) ; 

RETURN! A->CDRP) ; 

END  CDR ; 

/************************************************** 


*  PROCEDURE  WHICH  RETURNS  THE  POINTER  * 

*  CONTAINED  IN  THE  CARP  FIELD  OF  THE  CFLL  * 

*  POINTED  TO  BY  THE  CARF  FIELD  OF  THE  * 

*  SPECIFIED  CELL.  * 


**************************************************/ 

DCL  CAAR  FNTRYtPTR)  RETURNSIPTR) ; 

CAAR:  PROC ( A )  PTR ; 

DCL  A  PTR; 

IF  A  *  NIL  THEN  PETURN(NIL): 

A  =  A-XARF;  , 

RETURN!  A-XARF)  ; 

END  CAAR;  | 

■ /*****************************  ********************* 

*  PROCEDURE  WHICH  WILL  PRINT  OUT  ALL  np  the  * 

*  INDIVIDUAL  CHARACTERS  ON  THE  LIST  SPECIFIED.  * 
****************************** ********************/ 

DCL  PRINT  ENTRY! PTR  J  ;  P 

PRINT:  PROC ( A )  RECURSIVE: 

DCL  A  PTR; 

IF  A-«=N I L  THF N 

IF  A->ATOMB  THEN  CALL  OUTPUT ( SYMRQL ! A)) : 

ELSF  00: 

CALL  PRINT! ( A->CARPJ» : 

CALL  PRINT!  (A-XDRF) )  ;  FND : 

END  PRINT; 


/** ********* ******************* ********************* 

*  PROCEDURE  WHICH  RETURNS  THF  CHARACTERS  * 

*  CONTAINED  IN  TH c  PNA*E  FIELD  nF  THP  * 

*  ATOMCELL  SPECIFIED  * 

********* ******** ********************************** f 
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DCL  SYMBOL  FNTRY(PTR)  RETURNS!  OH  AR  ( PO)  VAR )  : 

SYMBOLS  PR  PC ( X )  CH  AR ( 8  C) VAR  S 
OCL  (x.yiptr: 

1  c  X=NI  L  THEN  RE TUP N< * N! L * )  : 

Y=X->CAR<=; 

RETURN! Y->PNAME) J 
END  SYMBOL: 

/**********************  *************  *************** 

*  PROCEDURE  WHICH  IS  USED  TO  CALL  THC  * 

*  PROCEDURE  TO  FORM  the  LIST  STRUCTURE  IT  * 

*  IS  ALSO  USED  to  ESTABLISH  A  POINTER  TO  THE  * 

*  MULTIPLE  DEFINITIONS  LIST  DURING  THE  * 

*  THE  INITIALIZATION  RHAS*.  f  * 

***********************************************♦**/ 

DCL  READSS  ENTRY  RETURNS ( °TR ) ; 

READSS:  PROP  PTR  RECURSIVE: 

OCL  UO,P)PTR»CK  FIXED  «IN  I  NIT  (0 ))  ST  AT  IC  • 

ERR  =  • O'B :  CK  =  CK+l  : 

0  =  S  FXPR<  'O'R) : 

IF  ERST  THEN  CALL  OUTRUTpILL  FORMFD  EXPRESSION'): 
IF  CK  =  1  THEN  DO: 

PP,L3[  =  0: 

Ic  TR|I  THEN  DP: 

P  ),=  CAAR  ( (  L? ) )  : 

DISPLAY! 'FIRST  ITEM  ON  MULTIPLE  DEFINITION  • 
I T 'Ll  ST  • | I P->PNAME) :  END: 

CK  =  CK+l:  end: 

ERR  =  »C'B:  CK  a  CK-1  : 

R ETUR N!  0  )  : 

END  READSS: 

/********  ***************************************** * 


*  PROCEDURc  WHICH  IS  USED  TO  TALL  THE  LEVEL  * 

*  ZFPC  PROCEDURE  WHICH  BUILDS  THC  LIST  * 

*  STRUCTURE  AND  CONTROLS  the  DURATION  0*  * 

*  THE  BUILDING.  * 


***********  ***************************************  / 

OCL  READS  ENTRY  RETURNS!  PTR)  : 

READS:  PROP  PTR  RECURSIVE: 

DCL  0  PTR  STATIC:  I 

ERR  =  '0'<B: 

0  =  S_EXPP ! *  1  •  B )  : 

IF  ERfT  THEN  CALL  OUTPUTCILL  FORMED  EXPRESSION *): 

DO  WHILE(SCAN-*=*  *»)  :  END: 

FRP  =  'O'B: 

RETURN(O); 

END  PEAOS: 

/********L* ********  *********  *********************** 

*  PROCEDURE  WHICH  BUILDS  th  =  'LISP  LIKE'  * 

*  STRUCTURE  OF  the  LISTS  USED  BY  THE  PROGRAM.  * 
********  ******************t  ************************  ^ 

OCL  S  EXPR  FNTRY(BITfl))  RE^pr^ Ptr) . 

S  EXPR:  PR OC ! D0T)  dtr  RECURSIVE:  , 

DCL  DOT  BIT(l) ,(0,R)dtr, (Tf SYM)rHAR(BO)VAR  STATIC: 
DCL  PIGHTSIDE  RFTUR NS ( PTR )  : 

RIGHTSTDE:  PR  OC  PTR: 

DCL  P  PTR: 

IF  DOT  THFN  PFTURN(Q) : 
r  =  SwFXPR( 'O'B)  ; 

IF  FRR  THEN  RFTURN(NTL): 

LEV  =  0->LE VP  L : 

P  =  0->C AR  R : 

IF  TRC  THEN  01 SP L A Y ( ' LE F T  CON 
J |p_>dnamf  ft'  AT  LEVEL’  I  I 
IP  R  -,=  NT  L  then 

IF  R->LE VE  L  =  LEV+1  THEN  DO: 


NECTTVE  SYMBOL  » 
L  EV )  ; 
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0->CDRF  *  R: 

R  »  S  FXPR  (  '  O'  R)  :  END; 

RETURN(CONSTO,PH  : 

END  RIGHTSIDE: 

t*scan; 

IF  T=M  •  THEN  DO: 

Q«  S_E  XPR ( ‘O'R! : 

IF  Frp  then  RETURN(NI l) : 

RETURN! R TGHTST DE ) :  END: 

IF  T  =  '<«  THEN  DO: 

LEV  =  LEV  +1: 

RE  TURN! P  EADSS ) :  END: 

IF  T  THEN  DO: 

IF  DOT  THEN  ERR  =  'l ' B: 

ELSE  RETU°N( NI  LJ  :  END; 

IF  T= ' ) '  THEN  DO: 

IP  DOT  thfn  ERR  =  «1 'B : 

ELSE  RF TUR N( NT L)  :  END: 

IF  T  *  •/*  THEN  DO: 

SYM  *  SCAN; 

L3  «  PP: 

ENC  ='1*B: 

0  *  CAR ( LOOK LI ST !  L3  ♦  SYM) ) ; 

ENC  *  *0'B : 

R  a  S  E  XPR  (  1 0 '  B!  : 

RETURN! CONS! Q,PI ) :  END: 

ELSE  DO: 

IF  T  =  'NIL*  THEN  0  =  NIL*. 

ELSE  0  =  STRING (TI  : 

RETUR N! R IGHTSI DE J  :  END: 

RETURN! NILI: 

END  S_EXPR: 

/************************************************** 

*  PROCEDURE  WHICH  IS  USED  TO  CONVERT  THE  ITEM*  * 

*  IN  THE  LIST  STRUCTURE  INTO  A  FORMAT  CAPABLE  * 

*  OF  BEING  PRINTED  OUT.  * 

*******************************************♦******/ 


DCL  BUILD  ENTRY(PTR,RIT!1) )  RETURNS ICHAR !1000)VAR ): 
BUILD!  PROC ! PtB)  CHAR ! IOOOI VAR  RECURSIVE: 

DCL  P  PTP.B  B I T < 1 ) : 

IF  P=NI L  THEN  RETUR N! * NI L •) : 

IF  ATOM! P )  THEN  DO: 


IF  P  -i=NTL  S  P->CDPF  -»  =  NT  L  TH?N  RETURN  (SY  MBOL  (  P I 
J | '<« | (BUILD! P->CDRF  *  '  1 ' B) T I • 1 5 
ELSE  RETURN!  SYMBOL!P)  )  :  END: 

IF  ATOMICOR(PJ)  THEN  OO: 

IF  CDP  !  P )  =  NIL  TH«=N 

IF  B  THEN  PETURN(BUTLD(CAR!P) , '0«R)) : 

ELSE  RETURN! •!»! j BUI LD( C AP ( P) , » 1 »  B ) J I *)');  ENI 

ft  R  TUCM  OCTIIBW1BI1TI  nfrjBIPI  I  I  t  Ml 


ELSE  RETURN  (  *(  *  f  i  r'ui  ll;\ l  ar  \  • 

IF  B  THEN  RETURN(RUILD!CAR(PJ t'O’B) I  I 
BUILD(CDR!P) » *  I  *  B 1 > : 

RETURN!  •(  ' IIBUILD(P,»1 • B1  I  f  »I  •)  : 

END  BUILD! 


/************************************************** 

*  PPOCEDURP  WHICH  IS  USED  TO  CALL  THC  * 

*  PROCEDURE  WHICH  IS  USET  'rO  PRTNT  OUT  THE  * 

*  ITEMS  STRUCTURED  BY  THE  BUILD  PROCEDURE.  * 

******** *********** ******* **********************♦*/ 


OCL  PRINTS  ENTRY ( PTR ) ; 

PRINTS:  PROC! A ) : 

DCL  A  PTR,B  CHAR(1000IVAR,(L.I)STATTC  FIXED  BIN 
DISPLAY! ’FORMATING  LIST  FOR  PRINTING'!: 

IF  TRI  THEN  DI SP L AY < « 01 CT IONA RY  PRINTING  TAKES 
j  I  'A  LONG  timc  ')  ; 

B  =  BUILD!A,'0'B) : 

L  =  LENGTH! B) : 


DO  1=1  TO  L  BY  60: 

CALL  OUTPUT! SUBSTR!B, I ,MIN(60tL-T+l) )) :  EMO: 
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IP  FID  THEN  CALL  OUTPUT (  •  $  » ) ; 

END  PP  INTS: 

/************************************************** 

*  PROCFOURC  WHICH  TS  USED  TO  SFARCH  DOWN  A  * 

*  SPECIFIED  LIST  TO  F  T  ND  THE  CHARACTER (SI  * 

*  SPECIFIED  BY  the  PARAMETER.  W  POINTER  TO  * 

♦THE  NEXT  ITEM  *DOWN»  THE  LI «T  TS  RETUPMFn.  * 
********  ****************************************** ^ 

LOOK 1 1  pSfi’o^^C ! TtJ ?H#" «*>''«" 1  returns  (ptr,: 

DCL  ( P 1 0 » R I PTR ,c  CHAR (*) VAR* 

00  WHILE  (  P->=NI L )  : 

0  =  P->CAPF; 

R  =  CAAR((D))  j 

IF  R->pNAME  =  C  THEN  RETURN (Q-XDR F)  : 

ELSE  IF  -.ENC  THEN 

IF  Q->ATOMR  =  *  1 1 • Q  THFN 
IF  L3  »  NT  L  THEN  DH; 

L  3  =  C  ONS ( 0  *  NI L ) : 

.13->LEVEL  =  l :  end: 

ELSE  DO: 

L3  =  CONS( Of L3)  : 

„  L3->LE VEL  =  l:  END: 

rfturN?Nu?-:X0PC!  bp  *  P!  END! 

END  LOOKLIST: 

/***************  ^^^^s*************************.^ 

*  PROCEDURE  WHICH  IS  USED  TO  ATTACH  TO  ALL  * 

*  IJEMS  ON  THE  1 L  2  ’  LIST  (THE  USER  DFFINED  * 

*  LIST)  ANY  ATTRIBUTES  OF  THAT  ITEM  FOUND  ON  * 

*  THE  «L1«  (DICTIONARY)  LIST.  * 

♦^i*******************************************^^^ 

DCL  ATTR8  ENTRY: 

ATTPB:  PROC: 

0CLriP;2:S;?IT,PTR’C  CHAR(20)VARtL  FIXED  *IN, 

Cl  Char (  i  ) : 

R  *  L2 :  L3  *  NIL: 

DO  WHILE  (R  -«  =  NT  L)  : 
p  =  Ll: 

S  =  R-XARFj 
T=CAAR((R))J 

c  =  t->pname: 

L  -  T->SI  ZE  : 

DO  I  *  1  TO  L: 

Cl  =  SUBSTR  ( C  t T  »1 )  : 

0  =  LOOKLT  ST( °»C1 ) : 

IF  0  =  NT  L  THEN  DO: 

DI SPLAY{ *  A TTRI BUTE  NOT  FOUND  FOR  **  MIC): 
GO  to  LB! :  END : 

ELSE  P  =  0:  END: 

0  =  LOOKLI  ST  ( P  »  •  *  * )  ;  _ _ __ _ _ 

0  =  DELE  TE ( 0) : 

IF  0  =  NIL  then  S-XDRF  =  L* : 

ELSE  DO: 

IF  L 3  is  NIL  THEN  DO: 

S-XORF  =  1.3; 

00  WHI  l  E  (  L3-XDRF-,=  NTL)  : 

L 3  *  L3-XDPF;  FMD: 

L3-XDfc  =  g.  end : 

ELSE  s-vnpc  =  g.  ENn: 

01 SPLAY( 'ATTC r butES  FOUND  =0B  MIC): 

=  p->^DRF;  L3  =  NIL;  END: 

END  attrb: 

/*********  •  ***** *********** /  *********************** 

*  PROCEDURE  WHICH  IS  USED  TO  OcLcte  crqm  the  * 

*  LIST  Qc  Attr]RUTES  THOSF  ATTPIBUTF$  WHICH  * 

*  WERE  PICKED  UP  DURING  THE  SE'.RCH  WHICH  "0  * 


FOUND  FOR  **  MIC): 


A3 


*  NOT  APPLY  TO  THE  RPFCIFTC  ITPY.  * 

**************************************************' 

OCL  DELE  TE  ENfRY(PTP)  RETURNS! PTR) : 

oel^!«?:§'S?^o,ptp,c  chmkoiv**. 

BO  =  0; 

DO  WHILE! Q  -=_NT  L>  : 

IF~SUBSTrI  S->PNAME  ,1  ,1)  =  THEN  DO: 

BO  =  O-OCORP: 

C  *  SUBSTRl S->PNAME,2) : 

PR2  L3  * 

*  DO  WHILP  (P  -.=  NI L)  ! 

T  *  CAAP((R>): 

I F  t->PNAmE  =  C  THEN  DO: 

T  *  R->CDRP*. 

TP  P  =  R  THEN  LB  =  T; 

ELSE  P->CDRF  =  T*. 

GO  TO  LB2  :  END: 

ELS*  R  =  R->CDPF:  END: 

LB 2:  0  =  BO:  END: 

ELSE  PETURN(BO) :  END: 

RETURN(BO) : 

END  DELETE: 

/♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦A********************************** 

*  PROCEDURE  c OR  THE  ON  LINE  ENT RY  OF  ITEMS  * 

*  INTO  THE  DICTIONARY  LIST  STRUCTURE. 
**************************************************/ 

CCL  DEFINE  ENTRY: 

DEFINE:  PROC  RECURSIVE: 

DISPLAY^ENTPR  *WDRO  FOR  DICTIONARY  ENTRY') 

REPLY(IPT) : 

L  =  LENGTH(IPT) : 

BP,L2  »  LI* 

DO  I  *  1  TO  L: 

Cl  *  SUBSTRdOT.I  .1)  : 

L 3  ■  LOOKLIST!L2,C1) : 

IF  I  =1  THEN  PP  *  L3: 

,F  )?  ,VI\FW.8nf»T  ,  su.sTsnpT,i,n; 

DiIpLAyI  'LETTERS55  ALREiDY1  In'dICTIONARY  •  I  I  IPT) 
CALL  PRINTS ( PP)  * 

DISPLAY! 'PNTER  REMAINDEP  Oc  WORD  AND  • 

I T 'ITS  ATTRIBUTES  IN  DICTIONARY  form.); 
BP->CDRF  *  READS:  END: 

ELSE  RP * L2  *  L3:  END: 

CALL  PP INT$( PP) t 

DISPLAY! 'DICTIONARY  COMPLETE  =  0»  ELSE  !') 

REPLY! I PT) : 

FLD  =  IPT:  IPT  =  "  : 

IF  FLD  THEN  CALL  DEFINE: 

END  DEFINE: 

/********** *************** ************************* 

*  PROCFDURE  WHICH  IS  USED  TO  CALL  THE  HASHX  * 

*  SUBROUTI NE  TO  RETRIEVE  THE  NUMERIC  CpnE  * 

*  WORDS  ASSOCIATED  WITH  THE  TOPIC  NAME.  _ 
♦*************************************************/ 

TAB  L  p •  ppnr ( C ) • 

DCL  C  CHAR ( 20) »  TBLCOOE  FIXED  BIN  INIT(O): 

CALL  HASHX(C  »CnOE t  5 » 1 1 1 ♦  TBLCOOE ):  _ 

IF  TBLCOOE  =  0  TMFN  DISPLAY!'  OPIC  NAME  AND  * 

I  I 'NUMERIC  CODE  WORD  **  MIC):  '  „ , 

ELSE  DI  SPLAY!  '  **  TOPIC  NOT  IN  TftfiLE  **  •  |  I  C  ) : 

END  TABLE: 


•110  5 
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07 

/**♦**  * **** *** ************ *********** *******  ******* 

*  DECLARATION  OF  thf  GLOBAL  VARIABLES  USED  * 

*  AND  DISPLAY  Oc  COMMAND  REQUESTS  TO  thf  * 

*  operator.  * 

**************************************************/ 

OCL  (L1,L2,L3,BP,PP) PTR,STNAME  CHAP(&0),IPT  CHAR(BO)VAR, 
ITRC,ENC  ,TRI ,RRR ,cLD,trl,DCtJRIT(1I , 

( !.L,LEVtTRLSI7E)PI XED  PIN: 

TRC,TRI ,ENC,ERR,flD,tbl,0CT='0'B:  LEV=0:  TBL*IZE=11: 

/***********************************************♦** 

*  INITIALIZATION  OP  THE  DICTIONARY  LIST  'Ll'.  * 

**************************************************/ 

IF  PARMS  -.=  "  THEN  DI  SPLAY! 'ENTER  OPTIONAL  PARAMETERS'! 
REPLY(IPT): 

jc  l pT  -,=  ti  then  GET  STRINGflPT)  DATA ( TRC ,TRI , FLC, 

TBL  DC T ) :  IRT  »  "  : 

CALL  IN  I T ; 

/********** **************************************** 
********************  *************  ****************** 

**  ROUTINE  POR  THE  OPERATION  OF  THE  PROGRAM.  ** 

************************* ************************** 
♦♦♦♦♦♦♦♦♦A****************************************/ 


REPLY  (STNAME) : 
ARAMETERS,»  II 


DO  WHILE! *1  'B)  : 

DISPLAY!'  '): 

DISPLAY! 'ENTER  NAME  AND  SMC  NUMBER')  REPLY (ST NAME 
DISPLAY!'  •): 

IF  TRI  THEN  01  SPLAY! 'ENTER  OPTIONAL  PARAMETERS,' 
'IF  NONE  THEN  RETURN')  REPLY(IPT): 

IP  IP T  -.=  ••  THEN  GET  STRING  (I  PT)  DATA(-RC,ENC, 

TP  I ,FLP  ,  TBL ,DC T)  :  I pt  = 

IF  DC T  THEN  call  define: 

ELSE  00 ; 

L2=P  EADS  * 

CALL  PR  I  NT! L2 ) : 

IF  AT0M(L2»  THEN  IF  SYMB0L(L2)='STOP*  THEN  FXI 
DISPLAY!'  •): 

DISPLAY!  '  A***************************** )• 

DISPLAY!'  MISTNAME): 

DI SPLAY!  '  ***************************** i j ; 


!  *  STOP  *  THEN  EXIT: 


DISPLAY!  ' 
DISPLAY!  • 
DISPLAY! • 
DISPLAY! » 
CALL  ATTRP 
DI SPLAY!  • 


CALL  PR  I  NTS! L2 I ;  END: 

IF  TBL  THEN  CALL  TABLE! »  **  TOPIC  NAME  **  •): 
DISPLAY!'  •): 

DISPLAY!'**  AT  THIS  POINT  YOUR  REQUEST  WOULD' I  I 
•  BF  PASSED  to  the  • )  : 

DISPLAY!'  TRANSLATION  AND  TABLE  SFAPCH  'll 
'ROUTINES.  **'): 

DISPLAY! »  •):  END: 

END  LIB_SFP  : 
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/************************************************** 
*********** **************************************** 

**  TABLE  PROGRAM  ** 

*************************************************** 

**************************************************^ 


TABLE:  PROC(PARMS)  OPTIONS! MAI N) : 

CCL  COOES  FILE  STREAM  ENVIRONMENT  (F(80n: 

DCL  <TRC,SWT)BIT(1)  ,<C0DE<11)  ,  ALFAKHARt^O)  , 

( I, DATA  SIZE  ,NRKF Y .T8L  SI 7E , TYPQP ,TBL CODE ) F IX ED  BIN, 

( PARMS, IPTJCHAR(2iVAR  : 

/********** *****************************'********** 

*  DECLARATION  OF  A  FILE  CONTROL  BLOCK  fnR  * 

*  USF  IN  THC  READIN  OF  THE  EXTERNAL  FILE.  _  * 
****************** ********************************/ 

DCL  1  FCB  STATIC,  . 

2  COMMAND  CHAR ( 81 .  _ 

2  FILENAME  CHAR ( B I  I NI T( • CODES • I , 

2  FILETYPE  CHAR (8)  I  MI T ( » DAT A « ) , 

2  CARDNUM  FIXED  PIN, 

2  STA TUS  FIXED  RIN, 

2  CAR  D_BUFF£R  CHAP  (BOM 

/********************** ********************* ******* 

*  PROGRAM  OPERATION  _  * 

******************  ***************»'•  ****************/ 

IF  PARMS  -»■  *•  THEN  TRC.  =  'I'B: 

ELSE  TRC  *  'O'B: 

T8LSIZE  =  11:  TBLCODE  =  0: 

DATASIZE  =  5:  NR  KEY  =  1: 

/****************************************♦********* 

*  R0UT1NF  TO  READ  THE  CODES  FILE  INTO  THE  * 

*  ARRAY  FOR  USE  DURING  THE  PROGRAM. 
**************************************************^ 

DISPLAY! 'INITIALIZATION  ^  THE  CODE  ARRAY* I : 

COMMAND  =  'ROBUF  •  ; 

DO  I  =  1  TO  T8LSIZE: 

CARONUM  =  I: 

CALL  IHEFILE(FCB) : 

IF°TRC  *THFNADl5PLAY( ’CODE f  '  !  1 1  I  I  ' )  • f I  CODE (I ) ) :  E! 

COMMAND  =  ’FINIS' ? 

DISPLAyF’COMPLETION  OF  INITIALIZATION'): 


/*********************************** *************** 

*  ROUTINE  WHICH  ALLOWS  THE  CONTINUAL  STORE,  * 

*  DELETION,  OR  RETRIEVAL  OF  ITEMS  FROM  THE  * 

*  ARRAY  UNTIL  THE  OPERATOR  IS  FINISHED.  * 

************************ ********* ****************♦/ 

Swt  =’i'B: 

D^SPl*  A  Y  (  '  E  NTfr  TYPE  OF  OPERATION:  ST0RE=0  ,  R  ETR I  EVE^l  ' 
If  ',  DELETE  =-l  ' )  RFPLYUPT): 

TYPPP  —  TpT  • 

DISPLAY! «ENTFR  I TE M  REQUIRED')  REPLY ( ALFA) : 

IF  TRC  THEN  0! SPLAY !« CALL  H  ASHX  •  )  : 

/************************************************** 

*  THE  FOLLOWING  ARE  THE  PARAMETERS  REQUIRFD:  * 
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*  ALFA  =  ITFM  UNDER  CONSIDERATION  * 

*  CODE  =  THF  TA  B  L  F  ARRAY  A0 E A  ^  * 

*  OATASI7E  =  NUMRER  np  COMPUTER  WORDS  IN  * 

*  THF  ITEM  PEI  NO  roNSTOFREn  * 

*  NRKCY  =  NUMPEP  OF  ITf-MS,  NORMALLY  ONE  * 

*  TYPOP  =  TYPE  OR  OPERATION  * 

*  TBLSI7E  =  THF  TABL=  ST  7  F  (PRIME  NUMBER)  * 

*  TRLCOOE  =  THE  TABLE  CONF I OUR ATT  ON  CODE  (0)  * 

a***************  ********************************** f 

CALL  HASHX( ALPA,CDDE,DATASI 7E .NRKPY ,TYPOR ,TRLSI ZE, 
TBLCOOEI: 

/^**** *************************** ****************** 

*  THE  POLL "WIND  ARP  THE  TABLE  CODES  WHICH  * 

*  RESULT  FROM  THE  CALL  TO  HASHX:  * 

*  0  =  OPERATION  SUCCESFULLY  COMPLETED  * 

*  1  =  TARLE  IS  PULL  * 

*  2  a  Itfm  IS  N0T  IN  THE  TARLE  * 

************************************************** f 

?fs?^n‘!Lys°ET;wi?^s?Tufv=D  m  aN0  code 

DISPLAY!^! F  FINISHED  ENTER  0,  ELSE  l'l  REPLYlIPTl! 
SWT  =  IPT:  end: 

/ t********* **********  ************************** 

*  ROUTINE  USED  to  WRITE  THE  APRAY  BACK  INTO  * 

****************** ******** ************************/ 

COMMAND  =  ‘ERASE* : 

CALL  IHEFILE(FCR) : 

COMMAND  =  'WPBUF': 

DO  I  =  i  T0#TBLS!7F: 

IPRTPCMTHEN  ’DISPLAY! *COOE ( • I  I  I ! I ' )  ’ll  CODE ( I )) : 
CARD  RIJPPFR  =  CODE  (  I)  : 

CALL-IHFF ILE( FCBI  :  AND; 

END  TABLE: 
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“OCibi 


/************************************************** 
**********  ***************************************** 

** 

**  HASHX  SURROUTINc  ** 

**  ** 
*************************************************** 
**************************************************/ 


MACRO 

HAKFY 

MACRO 

LM 

LM 

XP 

SRDA 

D 

LR 

MR 

c 

BNL 

LCR 

LR 

AP 

MEMO 

MACRO 

KEYCK 

MACRO 

LM 

CL 

BNE 

CL 

BNE 

CL 

BNE 

MEMO 

MiC?-'’ 

HCOLL 

MACRO 

AR 

CR 

BNL 

BXLE 

SP 

LP 

AP 

B 

MEND 


USED 
At  5 1 
6,7, 
6,7 
6,32 

6 ,  TE 

7.6 

6,2 

7 ,  =c 

*+6 

7.7 

12.7 

12,1 


TO  CCNSTRUCT  THE  HASH  CODE  Q IR  PL  A  CEMENT 
TPMP+2*  LOAD  THE  COL  L  T  S !  ON  CONSTANTS 
0(11)  ESTABLISH  HASH  CODE  INnpx 
REDUCE  THE  KEY  TO  ONE  WORD 

MP+12  HASH  COPE  INDEX 

HASH  TABLE  DISPLACEMENT 

'0* 

LOAD  COMPLEMENT  !F  MEOATJ vp 
5  ESTABLISH  HASH  TABLE  FNTRY  POTAT 


£RP1 

USED  tq  COMPARE  THE  INDyT  WITH  THE  STORED  ITPM 
8,10,0(11)  LOAD  THE  CIRST  THRpc  WORDS  OF  DATA 
8,0(12)  CHECK  THE  CIRST  WPRO  PQB  MATCH 

EBP1  BRANCH  IF  NOT  SAME 

9,4(12)  CHECK  THE  SECOND  WOPD 

EBP  1  BRANCH  IF  NOT  SAMP 

10,8(12)  COMPARE  THE  THIRD  W>RD  c0R  match 
EBP1  BRANCH  IF  NOT  SAME 


ERP2, EBP3 
USED  TO  COMPUTE 


4,2 

4.5 
EB°3 

7 ,4,*+6 

7.5 
12,7 
12,15 
ERP2 


THE  COLLISION  DTRRLACEMPMt 


ADO  DATA  SIZE  TD  LOWER  COLL  TONST 
CHECK  ^0  SEE  IE  WITHIN  TAPIP  ARFA 
IF  N0T  GO  TO  T ABL E  FULL  ROUTINE 


RECOMPUTE  NEW  HASH  TABLE  ENTRY 


MACRO 

MVCHA  f.Rl,£R2,EBPl 

MACRO  USED  TO  MOVE  THE  DATA  FROM  INPUT  T0  HASH  TABLE 
OR  FROM  HASH  TAB! E  TO  OUTPUT 


AR  2,1  REDUCE  NUMBER  OF  CHAR  BY  ONE 

EX  2  ,*+  8  MODIFY  THE  NUMRFR  pc  CHAR  MOVED 

B  *  +  10 

M VC  Of 1,SR1.) ,0(ER2.) 

SR  2,1  RESTORE  THE  NUMBER  OF  CHARACTERS 

AR  11,2 

BCT  3, EBP!  CHECK  TO  SCF  1  =  ALL  DATA  PROCESSED 

B  EXIT  IP  PINISHEO  BRANCH  TO  EXIT  RnuT I NE 

MEND 

MACRO 
TPLFUL  ENR 
L  1 , TE  MP+1 6 

L  2,=F*ENR.» 

ST  2,0(1) 
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♦ 

* 

* 

*  . 

A 

A 

A 

* 

A 

* 

A 

A 

A 

A 

A 

A 

A 

HASHX 


B 

MEND 


EXIT 


repro 


DuCIBL£ 


AFTER  INITIALIZATION  tHc  COLLOWTNG  ARE  THE 
REGISTER  ASS’GNMp  NTS. 

R-0  CONTAINS  A  CONSTANT  BLANK  F QR  COMPARISONS 
R  — 1  CONTAINS  A  CONSTANT  mi  NUS  0NF 
R-2  CONTAINS  THE  DATA  LENGTH  IN  *YTFS 
R-3  CONTAINS  THF  NUMBER  OF  ITEMS  T0  BF  PROCESSED 
R-4  CONTAINS  A  COLLISION  CONSTANT  ({ o«-l ) /2*®2 ) 
R-5  CONTAINS  A  COLLISION  CONSTANT  ( P#*R 2 ) 

R-ll  CONTAINS  THE  CURRENT  OATA  AnDRFSS 
P-12  CONTAINS  THE  H  A  RH  CODE  DISPLACEMENT 
R- 1 3  CONTAINS  PROGRAM  BASE  ADDRCSS 

P-14  CONTAINS  THE  DATA  RASE  AODB F SS 
R-l 5  CONTAINS  THF  HASH  TABLE  BACE  ADDRESS 


R-5  CONTAINS  A  C^l 
R-ll  CONTAINS  THE 
P-12  CONTAINS  THE 
R-13  CONTAINS  thc 
P-14  CONTAINS  THE 
P-15  CONTAINS  THF 
R-6 


TO  R-l 0  ARE  WORKING  REGISTUPES 


TEMP 


USING 

B 

DC 

DC 

STM 

IP 

CNOP 

BAL 

DROP 

USING 

DS 

ST 

ST 


C  SEC T 


*♦15 

12<  0,15) 

ALH6) 

CL6'HASHX* 

14,12, 12(  13) 

12,13 

0,4 

13, *+76 
15 

*♦13 

18F 

13, 8(  12) 
12,4( 13) 


BRANCH  AROUND  IDENTIFIER 

IDENTIFIER 
SAVE  REGISTURES 


BEGIN  PROGRAM 


* 

* 

* 

LnP  l 
L0°  2 


LM 

LM 

L 

L 

L 

L 

CNOP 

B 

DS 

STM 

SLA 

ST 

LR 

BCTR 

SRA 

MR 

SLA 

MR 

ST 

ST 

LR 

L 

L 

L 

C 

BE 

C 

BE 

B 


14,15,0(1) 
2, 6, 8(1) 
2,0<  2) 
3,0(3) 
4,0(4) 

5 , 0(  5) 

0,4 
i-  +  36 
9F 

;? » 6 ,  tp  m  p 

?,? 

2,TEMP+20 

7,5 

5,0 

5.1 

4.2 

7.2 

6.2 

5 ,  tpmp+24 
7 ,  TE  MP+2  8 
11,14 
1,=F  '-1  * 

0  ,  =C  '  * 

6, =F 'O' 
6.TEMP+9 
L  OP  1 

1 ,TFMP+5 

PELT 

RTRV 


LOAD  KEY  ADDRESS  AND  HASH  TABLE  BASE 

LOAD  PARAMETERS 

LOAD  KEY  AND  DATA  ST7E 

LOAD  NUMBER  OF  KEYS  OR  "ATA 

LOAD  TYPE  Oc  OP ( RCT=1 ,ST=0 , DEL=-1 ) 

LOAD  TABLE  SIZE  (PRIME  NUMBER) 


STORAGE  SPACE 


parameters 


KEY/DATA  SIZE  TTMPS  FOUR 

TEMP  STORE  Oc  DATA  SIZE  IN  BYTES 

ESTABLISH  COLLISION  CONSTANTS  TO 


SUBTRACT  ONE  FROM  TABLE  SIZE 
MULTIPLY  TABLF  SI7F  BY  DATA  SIZE 
MULTIPLY  TABLE  SIZE  BY  cOUR 
MULTIPLY  4-TABLE  SI7E  BY  DATA  SIZE 
U —  TEMP  STORE  CE  CONSTANTS 


STORAGE  ROUTINE 


HAKEY 

C 

BE 

r 


0,0( 12) 
*  +  16 
1,0(12) 


ESTABLISH  CONSTANT  MT WUS  ONE 
ESTABLISH  CONSTANT  BLANK 
ESTARLTSH  ZERO  CONSTANT 
CHEr K  TO  SCF  Ic  STORc  nopRATION 
BRANCH  TO  STORE  ROUTINE 
CHECK  TO  SEE  IF  DFLET  OPERATION 
BRANCH  TO  DELCTI ON  ROUTINc 
BRANCH  TO  RETRIEVF  POUTINc 


CHECK  =0P  EMPTY  SLOT 

IF  RMPTY  BRANCH  Tp  CHARACTER  MOVE 

CHECK  TO  SEE  IF  ITEM  DELETED 
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IF  DELETED  BRANCH  TO  MnvE  ROUTINE 
BRANCH  TO  FULL  SLOT  ROUT  INF 


FSL  T 

CLS2 

* 

♦ 

* 

RTRV 

LOP3 

CL  SI 
* 

* 

* 

DELT 

L0P4 


CLS3 

* 

* 

* 

TFL1 
TFL  2 

* 

* 

* 

EXIT 


PE 

B 

MVCHA 

KEYCK 

MVCHA 

HCOLL 


*+e 

FSLT 

12,11, LOP1 
CLS2 

12, 1 1 , LOP1 
L  OP2 , TCL1 


RETRIEVE  ROUTINE 


HAKEY 

KEYCK  CLS1 
MVCHA  11,1 
HCOLL  LOP3 


11, 1 2 ,RTRV 
LOP3,TFL2 


DELETION  ROUTINE 


hakfy 

KEYCK 

ST 

ST 

BCT 


CLS3 
1,0< 12) 
1,4(12) 
3, DELT 


DC  I  3  ,UC  L  I 

HCOLL  L0P4,TFL2 

TABLE  FULL  ROUTINE 

TBLFUL  1 
TBLFUL  2 

RETURN  ROUTINE 

L  13,4(13) 

LM  14,12,12(13) 
MVI  12(13),  X'PF* 
LA  15,0(0,0) 

BP  14 

END 


RESTORE  REGTSTURES 
SET  RETURN  INDICATION 
LOAD  RETURN  CODE 
RETURN 


/*********  *************  **************************** 
********************  ********  *********************** 
**  ** 

**  DICTIONARY  FILE  ** 

**  ** 

********************** ***************************** 
********  ***********  ****************  £**************/ 


U<MSPLR<S<^<PLUP>»  WE  SPLR<E<$<*<PLUP»'»  MLY<L<Y<*<-AOJP 
ADVBP»»  MION<T<0<N<^<-VPRRP  NOUNP»»>  «1  NG<I<N<G<*<PRSPT 
>»»  M4i_<6<L<*<-NnUNP  ADJP'»»  MED<E<D<*<D  AST»»  MEST<F<S 
<T<*<SIJPPL>»»  MER<E^R<*<CCWPP»»  MIC<I<C<*<:-NOUNP  ADJ°» 

»mment<m<e<n<t<*<-vfprp  mounp>>»»  ma ble<a<p<l<e<*<-veppp 

AOJP»»»  MICS<I  <C<*<-NOUNP  AD  JP>  S<*»»  MO<D<*<PAST»» 

E  <L<E<C<T<R<NOUNP  P<N<AMTC$>  D<F<*»  L<Y<S<I<S<*»»»  T 
<C<I<T<Y<*»>  /M4L»»»» 

T  <0<*<PREPP  ADVRP»  H<E<*<ADJP  DECART»  A<T<*<ppNOlJNP 
AOJP»»  E  <L  <c  <M<F  <T<P  <Y<*<NOUNP»»>»»'> 

A  <*<ADJP  INDART>  N<*<ADJP  JNDART^  L  <L<*<  ADJP>»  C<n<U<R 
<MOUMP  T</MICS»»  Q<U<T<P<VFRRP  E<*  AMD>  AMING>»»  B< 
0<U<T<*<PRFPD»'>»  M<*<VE P R°>» 

0  <B<T<A<I<N<VEPBP  *  AMING»»>  PREPP  N<*>  F<*>  U<T<*< 
-PREPP  ADVRP>»> 

T  <*<pRNOUN>  N<*<PRF°P'>  p<NHUNP  0<*  R<m<*<-mouNO  VERRP^  A 
<T<  AM I  ON  >>>■>>>  T<E<R<E<S<T<VERBP  *  $<*>  AMED»>»»  S<* 
<VERR°>» 

N  <A<T<U<R  <F  <*<NOUNP»  AMAL»»> 

S  <0<M<F<*<fiDVRO»>>  E<N<D<VEPRP  *  AMTNG»» 

R  <E<F<E<R<VEPRP  *  P<AMING  AMEO»»  P<0<R<T<N0UNP  * 

/MSPLR  >>>■>">> 

H  <A<VERB P  S<*<PR?SG»  0<*<PAST»  V<E<*<PRN3SG»  AMI NG» 
O<L<D<I<N<G<N0UNP  *  /m<  PLR>^>»V» 

L  <A<N<G<U<A<G<F<NOUNP  *  /MSPLP»»»>  !<K<E<*<VEPRP>» 
NOUNP  S <T<I  <M<G <*  /MSPLP'»>'»  N<G<U<I<S<T<A  MI  CS>»V»» 
D  <0<V5RPP  *  AMTNG^  F  <S<T<R<0<Y<F<R<NOUMP  *  /MSPLR>»»» 
»  U 

C  <A<L<C<U<L <U<S<*<NOUNP»>»>»  0<U<P<1<E<D<NPUMP  * 
AMSPLR^^^  M<P<U<T<F<R<NOUNP  *  AM$PLP>'»»»> 

F  <I<N<D<VERRP  *  AMING»» 

M  <f<*<prnchjnp>» 

W  <A<N<T<*<VERBP»»  0<U<L<D<*<VFRBP»»,>  I<T<H<*<PREPP» 
»  H<I  <C<H<*<ADJR»»  A<T <*<I  NTCP  ADVRP»>» 

Y  <0<U<PRNIJUNP  *  R<*<POSSP»»'> 

G  <I<V<VERBP  E<*>  AMI  NG»  A<V<E<VEopp  pfi<T»» 

P  <L<E<A<S<E<*<VPPnP>»»» 

B  <I<B<L<I<C<G<R<A<P<H<NOUNP  Y<*>  T<C<AMAL»>»»»»» 

V  K  X  J  0  Z 
.  <*<PUNTP» 

,  <*<PUNTC» 

?  <*<PUNTI»JT 
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